Hello,
I am new here and encountered an issue while analyzing and running regressions for the knowledge score variable. Unfortunately, I don’t have access to the do-file that the previous biostatistician used to encode the `score_knowledge` variable.
The author has added results from additional knowledge questions and requested that I calculate and integrate these into the existing `score_knowledge` variable.
Here’s what I know:
1. **Example of an individual knowledge question**
- `k10_correct` is a recoded variable from `k10_expdate_12m` (Knowledge Q10).
- It’s a numeric variable (byte) with values:
- `0 = Wrong`
- `1 = Correct`
2. **Codebook of the `score_knowledge` variable**
- Type: Numeric (float)
- Range: 0–90 (in units of 10)
- Mean: 45.7251, Std. Dev: 16.6574
My Question:
Is it correct to calculate the updated knowledge score by:
1. Adding the total number of correct answers across all knowledge questions (existing and new),
2. Then dividing by the total number of knowledge questions to get the average score?
Alternatively, is it possible to reconstruct or modify the original encoding logic for the `score_knowledge` variable to ensure consistency?
Thank you for your guidance!
Comment